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There has been considerable recent interest, primarily motivated 
by problems in applied algebraic topology, in the homology of ran- 
dom simplicial complexes. We consider the scenario in which the ver- 
tices of the simplices are the points of a random point process in 
R d , and the edges and faces are determined according to some deter- 
ministic rule, typically leading to Cech and Vietoris-Rips complexes. 
In particular, we obtain results about homology, as measured via the 
growth of Betti numbers, when the vertices are the points of a general 
stationary point process. This significantly extends earlier results in 
which the points were either iid observations or the points of a Pois- 
son process. In dealing with general point processes, in which the 
points exhibit dependence such as attraction, or repulsion, we find 
phenomena quantitatively different from those observed in the iid 
and Poisson cases. From the point of view of topological data anal- 
ysis, our results seriously impact on considerations of model (non) 
robustness for statistical inference. 

1. Introduction. There has been considerable recent interest, primar- 
ily motivated by problems in applied algebraic topology, in the homology 
of random simplicial complexes. Two main scenarios have been considered. 
In the more common, the vertices of the simplices are a random point set, 
and the edges and faces are determined according to some deterministic 
rule, typically related to the distance between pairs, or general subsets, of 
vertices. This has lead, for example, to the random Cech and Vietoris-Rips 
complexes studied in papers such as [1, 18, 19]. 

Another approach has been to consider random subsets of complete graphs, 
leading to a number of papers dealing with the homology of Erdds-Renyi 
clique complexes, as in, for example, [12, 17, 22, 25]. 

The current paper is concerned with the first of these approaches, al- 
though with a novel and - from the point of view of both theory and ap- 
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plications - important change of emphasis. Previous papers on simplicial 
complexes built over random point sets have always assumed that the points 
were either independent, identically distributed (iid) observations from some 
underlying distribution on R d , or points of a (typically non-homogeneous) 
Poisson point process. Our aim in this paper is to investigate situations 
in which the points are chosen from a general point process, in which the 
points exhibit dependence such as attraction or repulsion. From the point 
of view of topological data analysis (TDA) our results, which show that lo- 
cal dependencies can have a major affect on the growth rates of topological 
quantifiers such as Betti numbers, impact on considerations of model (non) 
robustness for statistical inference in TDA. We shall not address these issues 
here, however, beyond a few comments in Section 1.3 below. 

To start being a little more specific, given a point process (i.e. locally finite 
random counting measure) $ on ~K d , recall that the random geometric graph 
G( < &, r), for r > 0, is defined as the graph with vertex set <3> and (undirected) 
edge set {(X,Y) G $ 2 :||A" — Y\\ < r}. The properties of random geometric 
graphs when $ is a Poisson point process or a point process of iid points 
have been analysed in detail (cf. [30]), and recently interest has turned to 
the richer topic of random simplicial complexes built over these point sets. 
We shall be concerned with Cech and Vietoris-Rips complexes. Let B x (e) 
denote the ball of radius e around x, and = {x±,X2, . . .} be a collection of 
points in 

Definition 1.1 (Cech complexes). The abstract simplicial complex C(&, e) 
constructed according to the following rules is called the Cech complex asso- 
ciated to <3? and e. 

1. The 0-simplices of C($,e) are the points in <3?, 

2. An n-simplex, or n-dimensional 'face', a = [x{ , . . . ,Xi n ] is in C(<J>,e) 
ifr\ n k = Q B Xik (e/2)^%, 

Definition 1.2 (Vietoris-Rips complexes). The abstract simplicial com- 
plex R($>, e) constructed according to the following rules is called the Vietoris- 
Rips complex associated to <£> and e. 

1. The 0-simplices of R($>,e) are the points in <£. 

2. An n-simplex, or n-dimensional face', a = [xi , . . . is in R(<&,e) 
if B Xik (e/2) n B x . m (e/2) + for every 0<k<m<n. 

More information on these complexes will be given in Section 4.1 when it 
is needed. Both of these (related) complexes are important in their own right, 
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with the Cech complex being of particular interest since it is known to be 
homotopy equivalent to the random Boolean set IJxe* ^x( e ) which appears 
in integral geometry (e.g. [31]) and continuum percolation (e.g. [24]). We 
shall concentrate in this paper on the ranks of the the homology groups 
- i.e. the Betti numbers - of these complexes in the random scenario. At 
a heuristic level, the fc-th Betti number, /3k, measures the number of k- 
dimensional cycles, or "holes", in the complex. 

A complementary approach to studying the topological structure of sim- 
plicial complexes is via (non-smooth) Morse theory, and here results for Pois- 
son process generated complexes are given in [7]. In fact, the local structure 
of Morse critical points is often more amenable to computation than the 
global structure of the Betti numbers. Thus we shall also take this route in 
parts of this paper. 

There are some recurring themes and techniques in the analysis of Betti 
numbers and Morse critical points, which are intimately related to the sub- 
graph and component counts of the corresponding random geometric graph. 
Thus, from the purely technical side, much of this paper will be concerned 
with the intrinsically interesting task of extending the results of [30, Chapter 
3] on subgraph and component counts of Poisson point processes to more 
general stationary point processes. Subgraph counts of a random geometric 
graph are an example of U-statistics of point processes. Hence, apart from 
their applications in this article, our techniques to study subgraph counts 
of random geometric graph over general stationary point processes could be 
useful to derive asymptotics for many other translation and scale invariant 
U-statistics of point processes (For example, the number of fc-simplices in a 
Cech or Vietoris-Rips complexes). Also, the results on subgraph counts are 
used to derive results about clique numbers, maximum degree and chromatic 
number of the random geometric graph on Poisson or i.i.d. point process ([30, 
Chapter 6]) and with a similar approach, our results can be used to derive 
asymptotics for clique numbers, maximum degree and chromatic number of 
random geometric graphs over general stationary point processes (see [4, 
Section 4.3.1]). This will take up all of Section 3, the longest section of the 
paper. From these results, we shall be able to extract results about Betti 
numbers (via combinatorial topology) as well as the numbers of Morse crit- 
ical points. In formulating results, we shall relate the topological features of 
the random simplicial complexes to known, inherent properties of the un- 
derlying point processes, including joint densities, void probabilities or Palm 
void probabilities. The first two of these properties, along with association 
properties, are known to be useful in studying measures of clustering, and 
their impact on percolation of random geometric graphs was studied in [5]. 
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Since our asymptotic results help quantify the impact of clustering measures 
such as sub-Poisson and negative association on topological features of point 
processes, they provide additional applications of the tools of [6] as measures 
of clustering. 

A sampler of some of our main results follows a little necessary notation. 

1.1. Some notation. We use |.| to denote Lebesgue measure and |.| for 
the Euclidean norm on R . Depending on context, |.| will also denote the 
cardinality of a set. As above, we denote the ball of radius r centred at x € IR^ 
by B x {r). For x = ( Xl ,...,x k ) £ R dk , let B x (r) = \J k =1 B Xi (r), fc(x) = 
h(xi, . . . , x k ) for h:R dk — > R and (fx = dx\ . . . dx k . Let 1 = (1, . . . , 1). We 
also use the standard Bachman-Landau notation for asymptotics 1 , and say 
that a sequence of events A n , n > 1 occurs with high probability (whp) if 
P {A n } — > 1 as n — > oo. 

1.2. A result sampler. We shall now describe, without (sometimes im- 
portant) precise technical conditions, some of our main results. Full details 
are given in the main body of the paper. We start with <£, a unit intensity, 
stationary point process on R d , and set 2 

T-n 1 ^ n l / d ' 

(l.i) $n = $n 

Let 

/? fe (C(<D n ,r)), /3 fe (i?($ n ,r)), 

respectively, denote the k-th. Betti numbers of the Cech and Vietoris-Rips 
complexes based on <3? n . 



1 That is, for sequences a n and b n of positive numbers, we write 

a n — o(b n ) <^=> for any c > there is a no such that a n < cb n for all n > no, 

On = 0(b„) <^=> there exists a c > and a no such that a n < cb„ for all n > no, 

a n — uj(b n ) for any c > there is a no such that a n > cb n for all n > no, 

a n = n(fc„) <=> there exists a c > and a no such that a n > cb n for all n > no, 

a„ = 6(6„) <^=> a n = 0(b n ) and a n = Q(6 n ) 



2 Note that our basic setup is a little different different to that of all the earlier papers 
mentioned above. To compare our results with existing ones on Poisson or iid point pro- 
cesses, note that rjj in our results typically corresponds to nr^ elsewhere. For a general 
(non-Poisson) point process, (1.1) provides a more natural setting. 
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In addition, let C k (& n , r) denote the set of Morse critical points of index 
k for the distance function 

dn{x) = rnin 11^ — -XII » 

and set 

N k ($ n ,r) = \{ceC k ($ n ,r):d n (c)<r}\. 

This paper is concerned with the behavior, as n — > oo, of f3 k (C($ n ,r n )), 
/3k(R($ n ,r n )), N k (® n ,r n ) and x(C($ n , r n )), where x denotes the Euler 
characteristic. In particular, we shall provide closed form expressions for 
the asymptotic, normalised, first moments of these variables, along with 
bounds for second moments for most of them. 

Throughout the remainder of this subsection we shall assume that is 
stationary, unit mean, and negatively associated (defined rigorously in Sec- 
tion 2.2). Additional side conditions may also need to hold, but we shall not 
state them here. Two simple examples for which everything works are pro- 
vided by the Ginibre point process and the simple perturbed lattice. Many 
of the results hold for various other sub-classes of point processes as well, 
but our non-specific blanket assumptions allow for ease of exposition. We 
divide the results into three classes, depending on the behavior of r n . 
I. Sparse regime: r n — > 0. Note that since the points of $ only gener- 
ate edges and faces of the Cech and Vietoris-Rips complexes C(<$> n ,r) and 
i?($ n , r) when they are distance less than r apart, and since <E> has, on, 
average, only one point per unit cube, if r is small we expect that both of 
these complexes will be made up primarily of the isolated points of We 
describe this fact by calling this the 'sparse' regime. 

In this setting, 

E{/3 (C($ n ,r n )} = E{/3 (-R($n,r„)} = 6(n) 

and for k > 1, there exist functions f k (r) = 1 or f k (r)— >0, as r — > 0, 
depending on the precise distribution of $ and on the index k, such that 

E{/3 fc (C($ n ,r n )} 
(1.2) E{/3 k (R(<f> n ,r n )} 
E{N k (Q n> r n )} 

and Var(iVfc(<I> n , r n )) = 0(E{N k (& n , r n )}), where Var(X) is the variance of 
X. In addition, E{n- 1 x(C($ n , r„))} -)■ 1. 



= e(nr^ +1 >/ fc+2 (r n )), 
= 9(nrW)/ 2fe + 2 (r n )), 
= 8(nr*/ fc+1 (r n )), 
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In the classical Poisson case, studied in the references given above, it is 
known that the same results hold with / = 1. 

Using stochastic ordering techniques, we shall also show that clustering 
of point processes increases the functions f k (r) and consequently the mean 
of the and N/. as well. Also, we know that for the Ginibre point process 
and for the zeroes of Gaussian entire functions, f k (r) = r k ( k ~ 1 \ Thus, there 
is a systematic difference between the scaling limits for Poisson and at least 
some negatively associated point processes. 

II. Thermodynamic regime: r d — > (3 e (0, oo). In this regime an edge 
between two points in which are, in a rough sense, an average distance 
of one unit apart, will be formed if they manage to get within a distance 

one another. Since, in most scenarios, there should be a reasonable 
probability of this happening, we expect to see quite a few edges and, in 
fact, simplices and homologies of all dimensions. Indeed, this is the case, 
and the main result in this regime is that topological complexity grows at a 
rate proportional to the number of points, in the sense that 

E{/3 k (C($ n ,r n )} = G(n), 

with identical results for E{/3fc(i?($ n , r n ))} and E{Afc(<E> n , r n )}. In addition, 
Var(JV fc ($ n ,r n )) = 0(E{N k (^ n , r n )}) and 

d 

k=l 

where the ffc($, (3) are defined in Theorem 5.2. Since there is no appearance 
in these results of an analogue to the / of (1.2), the normalisations here 
have the same orders as in the Poisson and iid cases. 

III. Connectivity regime: r% = G(logn). Clearly, if r n is large enough, 
there comes a point (which we call the contractibility radius) beyond which 
all the points of <I> n will connect with the others and both the Cech and 
Vietoris-Rips simplices will become trivial, in the sense that the graph over 
the points will be totally connected, and so the various simplicial complexes 
will be contractible to a single point. (This is certainly the case if r n = 
\fdn l l d .) The question, then, is "how large is large enough". 

It turns out that in the current scenario of negative association there exist 
case dependent constants C such that for, r n > C(logn) 1 ' d , C(& n , r n ) is con- 
tractible whp as n — > oo. In the specific cases of the Ginibre process or zeroes 
of Gaussian entire functions, this happens earlier, and r n = ©((log n) 1//4 ) is 
the radius for contractibility of the Cech complex. As a trivial corollary, it 
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follows that, whp, x(C($ n ,r n )) = 1 when r n is the radius of contractibility. 
Further, for the Ginibre process, r n = 0((log n) 1//4 ) is also the critical radius 
for ^-connectedness of the Vietoris-Rips complex. 

1.3. Some implications for topological data analysis. Perhaps the core 
tool of TDA is persistent homology, as visualised through barcodes and 
persistence diagrams (cf. [10, 13, 16, 35]). While here is not the place to go 
into the details of persistent homology, it can be described reasonably simply 
in the setting of this paper. For a given n, and a collection of points 
consider the collections of Cech (or Vietoris-Rips) complexes C(& n ,r) built 
over these points, as r grows. Initially, C(3> n , 0) will contain only the points of 
$ n . However, as r increases, different homological entities (cycles of differing 
degree) will appear and, eventually, disappear. If to each such phenomenon 
we assign an interval starting at the birth time and ending at the death 
time, then the collection of all of these intervals is a representation of the 
persistent homology generated by $ n and is known as its 'barcode'. The 
individual intervals are referred to as 'bars'. The Betti numbers /3k(C(<& n , r)) 
therefore count the number of bars related to A;-cycles active at 'connection 
distance' r. 

Heuristics and (unpublished) simulations indicate that bars of regular or 
negatively correlated point processes start later and vanish earlier than those 
for Poisson point process. Some of our results confirm this heuristic. For 
example, using the results above it is easy to see that non-trivial homology 
groups of Cech and Vietoris-Rips complexes start to appear once r n satisfies 
r d(k+i) jk+2^ r ^ _ w ( n — 1)_ p or the Poisson case this requires only r n = 
o;(?i~ 1 / d( ' fc+1 )). Since, typically, f(r) — > as r — > 0, we therefore generally 
need larger radii for non-trivial homology to appear. The disappearance 
of homology is harder, however, and in general our results on connectivity 
cannot confirm the heuristic. However, for the Ginibre point process and 
zeroes of GEF in M 2 , they do show that non-trivial topology vanishes at 
r n = w((log ro) 1 / 4 ) as opposed to a; ((log n) 1//2 ) for a two dimensional Poisson 
process. 

As for implications to TDA, applied topologists are beginning to appre- 
ciate the fact that stochasticity underlies their data as a consequence of 
sampling, and are beginning to build statistical models to allow parameter 
estimation and inference (e.g. [8, 11, 26, 34]). The results of this paper show 
that small changes in model structure (such as the introduction of attraction 
and repulsion between points in a data cloud) can have measurable effects 
on topological behaviour. 

The remainder of the paper is organised as follows: In the following sec- 
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tion, we shall summarise some facts needed from the theory of point pro- 
cesses. Sections 3, 4 and 5 contain the core technical results on component 
and subgraph counts, Betti numbers and Morse critical points respectively. 
We shall give careful proofs for all the results of Section 3. The results of the 
Sections 4 and 5 are either easy corollaries of earlier results or can be proved 
by using similar techniques, and so there we shall give less detail. Section 
6 contains a technical result regarding Palm void probabilities of Ginibre 
process which Manjunath Krishnapur proved for us. 

2. Point processes. Our aim in this section is to set up some general 
definitions related to point processes, give some background on those of 
main interest to us, and to prove two technical results, of some independent 
interest, which we shall need later. 

2.1. Point processes and Palm measures. A point process $ in M. d is a 
AA-valued random variable, where N is the space of locally finite (Radon) 
counting measures in W* 1 equipped with the canonical cr-algebra (cf. [20, 33]). 
We can represent $ as either a random measure, $(•) = X^^qO)) or as a 
random point set $ = {Xj}j>i, where, in both cases, the Xi are the 'points' 
of the process. 

The factorial moment measure of a point process <3? is defined by 

«^(n^) = E{n*(s*)}, 

for disjoint bounded Borel subsets B±, . . . ,B n . When k = 1, a := a*- 1 ) is 
called the intensity or mean measure, and also serves as the intensity 
measure of the point process 

*W := {(X 1 ,...,X k )£* k :X i ?X j , V i ^ j}. 

The k-th joint intensity, p^:(M. d ) k — > [0, oo) is the density (if it exists) of 
a k with respect to (in this paper) Lebesgue measure. The p^ k > characterize 
the point process just as moments characterize a random variable. Through- 
out, we shall restrict ourselves to simple stationary point processes of unit 
intensity; viz. a(B) = \B\ for all bounded, Borel B. Most of our results, 
however, will depend mainly on the behaviour of p k in the neighbourhood 
of the origin. 

For a point process whose probability distribution is P, its reduced 
Palm probability distribution ¥ x at x G R d is defined as the measure whose 
corresponding Palm expectation E^. satisfies the following Campbell- Mecke 
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formula (cf. [33, pll9]) for any bounded measurable function h : N x M. d — > 

R + : 

(2.3) e\Y,K*\{X},X)\ = I K{h($,x)}dx. 

Ue* J 

In particular, we assume that V x {<S>(x) = 0} = 1. Intuitively, P^, is the 
distribution of the remainder of the point process, conditioned on there 
having been a point at x. 

2.2. Some special cases. We shall assume the reader is familiar with sta- 
tionary Poisson point processes, determined, for example, by p k = 1 for all 
k, and use this as a basis for comparison in a quick tour through some non- 
Poisson cases that will provide examples for the theorems of the remaining 
sections. 

Associated point processes. A point process 3> is called associated (or posi- 
tively associated) if for any finite collection of disjoint bounded Borel subsets 
Bi, . . . , B/. C M rf and /, g continuous and increasing functions taking values 
in [0,1], 

(2.4) Cov . . . , *(B k )), . . . , HB k ))) > 0, 

cf. [9]. Examples of associated processes abound. We call a point process $ 
negatively associated if 

(2.5) Cov g($(B k+1 ),...MBi))) < 0, 

for any finite collection of bounded Borel subsets B±, . . . ,Bi C M. d such that 
(Bi U • • • U Bk) n (.Bfc_|_i U ■ ■ ■ U Bi) = and /, g increasing bounded continuous 
functions. 

As we shall see, examples of negatively associated processes are scarcer 
than their positively associated counterparts, a phenomenon that occurs 
even in simpler situations; cf. [28]. The stationary Poisson point process 
is both negatively and positively associated. Finite independent unions of 
negatively associated point processes is negatively associated as well and 
this can be used to construct many examples of negatively associated point 
processes from a few simple examples. Here are three others that will interest 
us: 
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Determinantal and permanental processes. A simple point process <3? on Mr 
is said to be determinantal with kernel K : (M^) 2 — > C if its joint intensities 
satisfy the following equality for all k > 1 and for all x±, . . . , x^ G M d : 

(2.6) p k (xi, ...,x k ) = det(K(xi,Xj)i< itj <k), 

where det indicates a determinant of a matrix. 

Stationary determinantal point processes are negatively associated ([23]). 
For examples of stationary determinantal point processes, see [21, Section 5]. 
A determinantal process of particular interest is the unit intensity Ginibre 
process ([2, Sec 4.3.7]) which has kernel 

K(z, w) = exp(— -(||z|| 2 + ||il>|| 2 ) + zw), z,w G C. 

For permanental point processes, the determinant in (2.6) is replaced by 
a matrix permanent. 

Perturbed lattices. Let N z : z G Z d be iid integer valued random variables 
distributed as N, and Xi z , i > 1, z G Z d be iid M. d valued random variables 
distributed as X. A perturbed lattice is defined as 

$(N,X) := |J + 

provided that <&(N,X) is a simple point process. is called the replication 
kernel and X is called the perturbation kernel. The point process for which 
N = 1 and X is uniform on the unit cube is known as the simple perturbed 
lattice and is negatively associated. For more details see below and especially 
[6]. 

Zeroes of a Gaussian entire function. (Normalized) Gaussian entire func- 
tions are defined on the complex plane C via the a.s. convergent expansion 
f( z ) = S^=o £>nZ n /\rd-i where the £ n are iid standard complex Gaussians. 
The zeros of / (when considered as a point process in M? and called as 
zeros of GEF) while neither negatively associated nor determinantal, share 
many properties with the Ginibre process that make them interesting and 
tractable; cf. [2] for more background. 

Sub- and Super-Poisson processes. At times, weaker notions than associa- 
tion, based only on factorial moment measures, suffice to establish interest- 
ing results. 
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We say that a point process is a-weaker than <3>2 (written $1 < a - w $2) 
if a[ k) (B) < a ( 2 k) {B) for all k > 1 and bounded Borel B C (M^. We call 
a point process a-negatively associated (associated) if a^ k+l \Bi x .62) < (> 
)a( fc ) {B l )a^{B 2 ) for all fc, Z > 1 and bounded Borel B 1 xB 2 C (R d ) k x(R d ) 1 . 

Negative association (association) implies a-negative association (associ- 
ation) which in turn implies a-weaker ordering with respect to the Poisson 
process with intensity measure a. 

Even weaker notions of association come from looking at void probabil- 
ities, and we say that a point process $1 is v-weaker than $2 (denoted by 
$1 <v-w $2) if 

ut(B) =P{$ 1 (B) = 0} < P{$ 2 (B) = 0} = u 2 {B) 

for all .B bounded Borel subsets. 

Finally, we call a point process a-sub-Poisson (super-Poisson) if it is 
a-weaker (stronger) than the Poisson point process and similarly for v- 
sub-Poisson (super-Poisson). A point process is weakly sub-Poisson (super- 
Poisson) if it is both a- and z^-sub- Poisson (super-Poisson). 

It is known that negative association (association) implies the weak sub- 
Poisson (super-Poisson) property. Other examples come from perturbed lat- 
tices. For example, if the replication kernel N is hyper geometric or binomial, 
then the resulting point process is a weakly sub-Poisson point process. On 
the other hand, negative binomial and geometric perturbation kernels lead 
to weakly super-Poisson processes. Permanental point processes are also 
weakly sub-Poisson. See [6] for proofs and more about stochastic ordering 
of point processes. 

2.3. Two technical lemmas. We shall state some general results about 
Palm measures of these point processes that we need later. The first lemma 
shows that negatively associated point processes are "stochastically stronger" 
than their Palm versions. This can also be viewed as a justification for the 
usage of negative association as the defining property of sparse point pro- 
cesses. The second shows that Palm versions of negatively associated point 
processes also exhibit negative association. We state the results in more 
generality than we need, since they seem to be of independent interest. 

Lemma 2.1. Let & be a negatively associated stationary point process in 
M. d of unit intensity and F:M. dn — > M + an increasing bounded continuous 
function. Then for B%, . . . ,B n disjoint bounded Borel subsets and almost 
every x £ R dk , 

(2.7) E l Xl ^ Xk (F(<f>(Bi), $(B n ))) < E{F(<I>(fli),..., *(£„))}. 
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The above inequality will be reversed for an associated point process. 
PROOF. For < e < r we have 

E{F($(Bi \ B x (r)), . . . , $(B n \ Bx(r))) | $(B Xi (e)) > 1, 1 < i < k} 

e{f(^(B 1 \ S x (r)), . . . , <&(B n \ B x (r))) nti W^Je)) > 1]} 

P{#(B^)) > 1,1 < i < k} 
E{F(d>( J B 1 \ £ x (r)), . . . , $(B n \ B x (r)))} E{n* =1 W^e)) > 1]} 

= E{F($(Bi \ B x (r)), • • • , *(B„ \ Bx(r)))} , 

where the inequality is due to the negative association property of 

Sending first e — ?• and then r — >• 0, (2.7) follows immediately from [32, 
Lemma 6.3] and monotone convergence. □ 

Lemma 2.2. Let & be a negatively associated stationary point process 
in R d of unit intensity. Let F:R dn -)• R + and G:R dm ->■ M+ 6e increas- 
ing bounded continuous functions. Then for B\, . . . , B m+n disjoint bounded 
Borel subsets and almost every x £ R d ( k+l ) ; 

E x { J F(<D(i? 1 ), . . . , $(£?„)) G(<&(B n+1 ), . . . , $(B m+n ))} p( fc+ ') (x) 
< E^JW),...,*^))} 

xEL fc+1 ,..., :Cfc+ ,{G($(B n+1 ) ) . . . , $(B m +n))} 
xp (fc) (xi, . . . ,x k )p^(x k+1 , . . .,x k+ i). 

The above inequality will be reversed for an associated point process <£. 

Proof. As in the proof of Lemma 2.1, take < e < r. For notational 
simplicity, set B* = B \ -B x (r) for bounded Borel set B. 

E{F($( J B 1 *), • • • , *(K)) G($(B* n+l ), HB* m+n )) 

| HB Xi (e)) > 1, l<i<(fc + Z)} 

xP{*(B Xl (e))>l, l<t< (* + *)} 
fe 

= E{F($( J B 1 *), . . . , $(5;)) JJ l[$(B Si (e)) > 1] 

i=i 
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I 

xG(^B* n+1 ),...,^B* m+n ))l[l[<f>(B Xk+i (e)) > 1]} 

i=l 

e{f(*(bd, . . . , <B(i%)) Ui=i > i]} 
- F{$(B Xi {e)) > 1,1 <i < k} 

E{G($(^ +1 ),...,$(^ +w ))nLil[^ fc+i (e)) > 1]} 
X F{$(B Xk+i (e))> 1,1 <i<l} 

xP {<f>(B Xi (e)) > 1, 1 < i < fc}P {*(B* fc+i (e)) > 1, 1 < i < Z} 

= E{F(*(flJ), •••,*(£»)) | <K^(e))>U 

xE{G($(B* n+1 ), . . . , *(5^ +n )) | $(B Xk+i (e)) > 1, 1 < i < l} 
xF{<S>(B x .(e)) > 1,1 <i < k} P {$(Bx k+t (e)) > 1, 1 < * < , 

where the inequality is due to the negative association of <3>. As in the previ- 
ous proof, the conditional expectations in the first and last expressions con- 
verge to the respective Palm expectations as e — > 0. Since <1> is a simple point 
process, after dividing by \Bo(e)\ k+l on both sides, the product of the prob- 
ability terms in the last line converges to p^ k >{x\, . . . , Xk)p^ (xk+i, ■ ■ ■ , Xk+i) 
while the probability term in the first line converges to p( fc+, )(x) as e -> 0. 
Complete the proof by sending r — > 0. □ 

3. Subgraph and component counts in random geometric graphs. 

Recall that for a point set $ and radius r > 0, the geometric graph r) 
is defined as the graph with vertex set $ and edge-set {(X, Y):\\X — Y\\ < 
r}. We shall work with restrictions of $ to a sequence of increasing win- 
dows W n = [ =J ^ — , 1L 2~] d , along with a radius regime {r n > 0}„>i, setting 
$ n := $nW n . The choice of the radius regime will impact on the asymptotic 
properties of the geometric graph when the points of are those of a point 
process. 

Let r be a connected graph on k vertices. In this section we shall be 
interested in how often T appears (up to graph isomorphisms) in a sequence 
of geometric graphs G n = G(Q n , r n ), and how often among such appearances 
it is actually isomorphic to a component of G n , viz. it is a T-component of 
G n . For graphs built over Poisson and iid processes, we know from [30, 
Chapters 3,13] that no T-components exist when n(r^) fc_1 — > (|T| = k), 
but that they do appear when n(r^) fc ~ 1 — > oo. The T-components continue 
to exist even when = o(logn) and vanish when r d = w(logn), which is 
the threshold for connectivity of the graph. 

In this section, we shall show, among other things, that the threshold for 
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formation of r-components for negatively associated processes with r n — > 
is n.(r^) fc_1 / fe (r n ) — > oo, for functions f k which typically satisfy f k (r) — > 
as r — > 0, and so is higher than in the Poisson case. These components 
continue to exist even when r d — > /3 > 0. The threshold for the vanishing of 
components will be treated in the next section. 

The reader should try to keep this broader picture in mind as she wades 
through the various limits of this section. 

3.1. Some notation and a start. As above, let T be a connected graph on 
k vertices, k > 1, and {x\, . . . ,Xk} a collection of k points in M d . Introduce 
the (indicator) function hr'M. dk x R + — > {0,1} by 

(3.8) fc r (x,r) := l[G({xi, . . . ,x k },r) ~ T], 

where ~ denotes graph isomorphism and 1 is the usual indicator function. 
For a fixed sequence {r n } set 

(3.9) /ir,n(x) := /i r (x,r n ), 
and, for r = 1, write 

(3.10) h r (x) := h r (x,l) 

Moving now to the random setting, in which $ is a simple point process 
with £>th intensities , we say that r is a feasible subgraph of <!> if 

I /i r (x)p (fc) (x)(ix > 0. 

J(R d ) k 

Thus, r is a feasible subgraph of $ if the measure of finding a copy of it 
(up to graph homomorphism) in G($, 1) is positive. In most cases, feasibility 
will hold because p( k \x) > a.e. or at least on a large enough set. 

We shall be interested in the the number of T-subgraphs, G n (&,T), and 
number of r-components, J n ($,T), of which are defined as follows 3 : 

(3.11) G n ($,r) := i Yl h r,n( x ) 

3 In the terminology of [30], Gn(4 ) ,r) denotes the number of induced F— subgraphs of 
G($, r) and not the number of subgraphs of G($, r) isomorphic to Y. However, it is easy 
to see that the latter is a finite linear combination of the number of induced subgraphs of 
the same order. 
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Jn(*,r) 



1 



£ hr,flWl[$n(BxW) = fe]. 



fc! 



Note that J n considers graphs based on vertices in $> n only, viz. all vertices 
that lie in W n . Such a graph, however, may have vertices within distance r n of 
a point in the complement of W n , and so actually be part of something larger. 
To account for this boundary effect, we introduce an additional variable, 
which does not count such "boundary crossing" graphs. This is given by 



We shall see later that in the sparse and thermodynamic regimes the dif- 
ferences between J n and J n disappear in asymptotic results. Nevertheless, 
both are needed for the proofs. 

The key ingredient in obtaining asymptotics for sub-graph counts and 
component counts are the following closed-form expressions, which are im- 
mediate consequences of the Campbell-Mecke formula. 



Much of the remainder of this section is based on obtaining asymptotic 
expressions for these integrals in terms of basic point process parameters 
in the sparse and thermodynamic regimes, as well as looking at bounds on 
variances. We shall consider the connectivity regime only in the following 
section on Betti numbers. Our results here extend those of [30, Chapter 3] 
for Poisson and iid processes, and the general approach of the proofs is thus 
similar. 

3.2. Sparse regime: v n — y 0. The intuition behind the following theorem 
is that in the sparse regime it is difficult to find T-subgraphs in a random 
geometric graph, and even more unlikely that any such subgraph will have 
another point of the point process near it. This implies that any such sub- 
graph will actually be a component of the full graph, disconnected from 
other components. 

Theorem 3.1. Let & be a stationary point process in M rf of unit intensity 
and r be a feasible connected graph of & on k vertices. Let be almost 



(3.12) 




n 
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everywhere continuous. Assume that p( k \0, . . . , 0) = ; and that there exist 
fM + -»• R + and g k : 



functions f k 'M + ->■ M.+ and g k :(B (k)) k —?■ M. + such that 



P {k) (ry) = e(f k (r)) and lim = g k p (y ) , 

/or a// y o/ the form y = (0, y2> • • • > 2/fc)- Further, assume that f k+l = 0(f k ) 
as r — > and is almost everywhere continuous. Let r n — > 0. Then, 

(3 15) lim E{G " (<I) ' r)} = lim E W*> r » 

= Mo(*,r) 

fi fc = i, 

Ih/k^-i) /»r(y)^J(y)rfy fc > l. 

If p( k \0, . . . ,0) > 0, £/ien i/ie same result holds with f k = 1 and g^ = 
p«(0,...,0). 

Before turning to the proof of the theorem we shall make a few points 
about its conditions, and provide some examples. Throughout we assume 
that all point processes are normalized to have unit intensity. 
1: Note that the theorem does not guarantee the positivity of po(§,T). 
2: / 1 (r) = 1 for all stationary point processes of unit intensity since, in this 
case, p^ = 1. 

3: It is easy to check that if $ is a-negatively associated or a-super-Poisson, 
then the condition f k+l = 0(f k ) as r — > is satisfied. 

4: In the case p k (0, . . . , 0) =0 for k > 2, even if we cannot find appropriate 
f k or g k , it is still true that E{G n ($,r)} = o(nri {k ~ 1] ). 
5: If is only Z d -stationary (as is the case with perturbed lattices), then it 
will be clear from the proof that (3.16) still holds, but with 

/"o($,r):=-/ / h T (x,y)g k Jx,y)dxdy. 
k - J[o,i] d Js. d ( k - 1 ') 

6: For a homogeneous Poisson point process, the theorem holds with f k = l 
and g k = 1, recovering [30, Prop. 3.1]. 

7: If $ > Q _„, $ ( i), then for all k > 1, p( k ) > pjjj = 1 and hence f k = 1 
and also po(&,T) > 0. Examples of point processes in this class are all 
super-Poisson perturbed lattices and permanental point processes. 
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8: For a perturbed lattice <I> with perturbation kernel N € {0, . . . ,K} a.s., 
pW(0,...,0) > iff k < K. In this case, jUo($,r) > for a connected 
graph r on k vertices. For connected graphs r on k vertices with k > K, 
nr n d ^ k 1 ^E{G? n ( < I ) , r)} — > 0. For sub-Poisson perturbed lattices, the exis- 
tence of f k depends on the perturbation kernel. However, for high values of 
k, it is clear that the scaling for sub-Poisson perturbed lattices will differ 
significantly from that of the Poisson case. 

9: From [27, Theorem 1.1] for the zeroes of Gaussian entire function and 
calculations similar to [2, Theorem 4.3.10] for the Ginibre point process, one 
can check that in both cases 

p k (xi, ...,x k ) = 6( JJ \\xi - Xj\\ 2 

i<j 

Hence, f k (r) = 0(r fc ( fc_1 )) for these processes. 

Proof. We shall prove the theorem for k > 2. The case k = 1 follows 
easily by making a few notational changes to the general case. We start 
with the convergence of E{G7 n (<I>, T)}. In the expression for E{G7 n (<I>, T)} in 
(3.13), make the change of variable Xi = x\ + r n yi for i > 2 and then use 
stationarity of the point process to obtain 

d(k—l) 

E{G n ($,r)} = / / fcr,n(xl + r B y) 

k\ JW n J{r- 1 {W n -x))^ 

xp { - k \xl + r n y)dx... dy k 

d(k—l) r. 

(3.16) = / / h r>n (r n y)p^(r n y)dx...dy k 

d(k— 1) p 
^ rjL T^~ / / h T (y)p^(r n y)dx...dy k 

>Wn JK. d ( k - 1 1 



nr. 



k\ 

d(k-l) 



hr(y)p {k) (r n y)dy. 

(fe-i) 



Since T is a connected graph, hr = outside (Bo(k)) and hence the 
preceding integral is finite. Further for all x G W i , it follows that 

(n <T — 2k) d 

Bo{k) C (W n -x)Cr- 1 (W n -x) 
for large n. Hence, for large enough n, 

d(k — 1) r. r. 

E{G n (3?,r)} > T -2—- / / h r (y)p {k) (r n y)dx... dy k 

k\ J W i J(r^ 1 {W n -x)) k - 1 

(n3 -2k) d 
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I n 



\ h v {y)p {k) (r n y)dx... dy k 

W i JR^- 1 ) 

( n 3 -2k) d 



{{n\ - 2k) d )r d n {k - l) 



k\ 

Thus, as n — > oo, 

E{G n ($,r)} i 



hr(y) P (k) (r n y)dy. 

(fe-i) 



hr(y)p (k) (r n y)dy. 



nr. 

Note that we can restrict the range of integration in the above equation to 
Bo(k). Since p^ k \r n y)/ f k (r n ) = g k (y) a.e. in Bo(k) and g k is bounded (as 
it is continuous) in Bo(k), we can use the Lebesgue dominated convergence 
theorem to show that, as n — > oo, 



This proves the convergence of expected number of T-subgraphs. 

We shall now show that the normalised expected numbers of components 
and subgraphs are asymptotically equivalent for small enough radii. This 
will complete the proof of the theorem. 

Using the lower bound of 1 — &(Bx(r n )) for the void term in J n (see 
(3.11)) we obtain the following lower bound for J n . 

Jn(*,r) >G n ($,r)-i Yl h r ,n( x MBx(r n )) = G n ($,T)-E n ($,r). 



Since J n < G n , we only need to show that d(k-i) \ ~~ ^ 0- From the 
Campbell-Mecke formula, we have 

E{E n (<S>,T)} = i f / lr ,n(x)EU^(i?x(rn))}p (fe) (x)dx. 
fc! J W k 

From [32, Lemma 6.4], we know that p'^\y) = ~(kj^f^- Now applying the 
Campbell-Mecke formula for in the above equation, we find that 

E{£ n (<D,r)} = i / h r ,n(x) P {k+1 \x,y)dxdy. 

K - Jw%xB x (r n ) 
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Now apply the change of variables Xi = x\ + r n yi for i > 2, y = r n y and 
proceed as in the case of K{G n }, to see that, for large enough n, 

E{E n ($,T)} < ^ / hr(y)p( k+1 Hr n y,r n y)dydy, 

where the additional factor of is due to the y variable. Dividing by 
nr n j (r n ) and bounding hr by 1, we have 

E{E n (<S>,T)} r d J k+l {r n ) f P (k+l) (r n y,r n y) 



d n {k - 1] f k {r n )~ k\f k {r n ) J Bo[k)k -i xBo{k+1) f k+1 (r n ) 



nr n 

Since f k+1 (r) = 0(f k (r)) by assumption, ^frrrr^ * an d hence 

nr n J (r n ) 

as required. □ 

The following corollary follows easily from the ordering of the joint inten- 
sities of the point processes. 

Corollary 3.2. Ze£ <3?j, i = 1,2, 5e too stationary point processes and 
/p., 3p. correspond to the functions of Theorem 3.1. If ^\ < a -w &2> then 

fp- L < fp 2 - tf f k Pl = fp 2 > then 9 k Pl < 9 k P2 , and hence Mo($l,r) < /x ($2, r) /or 
a connected graph V that is feasible for both <J>i and $2- 

3.3. Thermodynamic regime: r d — > /3. 

Theorem 3.3. Let & be a stationary point process in M. d of unit intensity 
and r be a feasible connected graph of 3> on k vertices. Assume that zs 
almost everywhere continuous, and let r d — > (5 > and y = (0, y2, ■ ■ ■ ,yk)- 
Then, 

hm = (ip{®,T) 

n— too n 



(3.17) 



1 k = 1, 

^/ R d {h -i)/»r(y)pW(i93y)dy ^>2, 



n— >oo n 
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fp , o {$(B o (^))=0} k = l, 

(3.18) := l^f Rd(k _ 1} h T (y)pW(^y) 

xP ! t {$(5 1 (^3)) = o) dy £;> 2. 

// $ is a negatively associated point process with P {$(i3 x (/33)) = o| > 
for almost every x € i?o(/3 3 i/ien 7 / g($,r) > 0. 

Again, before turning to the proof, we make some observations about the 
theorem: 

1: The positivity of 7^($,T) is not immediate. For an example in which this 
does not hold, let $o be a Poisson point process of unit intensity in M. d , 
i > 1 iid copies of the point process of 4 iid uniformly distributed points 
in i?o(/3 3 /2), and define the Cox point process, 

<£> := U {*i + $ i>- 

Clearly, for all Ie$,P {$(£ x (^) > 4} = 1. 

Now take = (3 and T a triangle, and note that J n (<3?,r) = for all 
n > 1 and so 7«($, r) = 0, even though all the assumptions of Theorem 3.3 
are satisfied. 

2: As in Corollary 3.2, <3?i < a - w $2 implies that /x<j(<i>i,r) < /i / g(<I> 2 , T). 
However, as the previous example shows, the situation for 7 | g($,T) is some- 
what more complicated. 

3: If |r| = 1, then J n ($,r) is the number of isolated nodes in the Boolean 
model of balls of radii f3 centred on the points of <3? The Palm measure of a 
determinantal point process is also determinantal and in particular, for the 
Ginibre process, p^ l \z) = 1 — e - " 2 " . Using this explicit structure, it can 
be shown that, for small enough /?, 

-yp(*Gin,T) > l-7r/3 2 + 7r(l-e- /32 ) > 1 - vr/5 2 + 0(vr 2 /3 4 ) = 7/3 ($ Po4 ,r), 

and hence the inequality for the 7/3 could be reversed in the thermodynamic 
regime for even negatively associated point processes as compared to the 
sparse regime. 

Proof. Since the proof here is similar to the preceding one, we shall 
not give all the details and, again, we shall only bother with the case k > 
2. Starting with (3.13) and (3.14), the proof follows similar lines to that 
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of Theorem 3.1. The difference is that r n 1 — > (3 k 1 and p^ k \r n y) — > 
p( fc )(/3dy) and so there is no need for additional scaling. For the convergence 
of J n , one first shows the convergence of J n using similar techniques to those 
in the proof of Theorem 3.1. Then, note that 

Jn($,r) < j n (<&,r) < j n ($,r) + G n ($ n /$ (ji i_ (jfc+iK)d ,r). 

The rightmost term in the upper bound accounts for the boundary effects, 
and by arguments similar to those in the proof of Theorem 3.1, it is easy to 
see that 

E^^^^T)] = 0(\W n /W {nUk+i)rn)d \) = O(n^). 

More importantly for us, this expectation is o(n), and so of lower order than 
J n (3>, T). Thus E{ J„,($, T)} jn also converges to 7 ( g(<&, T). Since r% ->■ (3 > 0, 
the void probability term in J n is not necessarily degenerate. 

The positivity of 7 ( g($,T) for negatively associated point processes is an 
easy corollary of Lemma 2.1. We need only note that 

= = 0] = (1-$(B))V0 

is a decreasing bounded continuous function and hence 

F^(B x (^)) = 0) > P{$( J B X (/33)) = 0| > 

for a.e. x € Bo(k) k . This completes the proof. □ 

3.4. Variance bounds for the sparse and thermodynamic regimes. The 
crux of the second moment bounds lies in the fact that, up to constants, 
variances are essentially bounded above (below) by expectations for neg- 
atively associated (associated) point processes. (It is simple to check that 
Var(.) = G(E{.}) for the Poisson process, which is both negatively associ- 
ated and associated (cf. [30, Chapter 3]).) We, however, shall need to extend 
these inequalities to graph variables, and this is the content of this section. 

Theorem 3.4 (Covariance bounds in sparse regime). Let T and To be 
two feasible connected graphs on k and I (k > I > 2) vertices, respectively, 
for a stationary point process $ with almost everywhere continuous joint 
densities. Let $ satisfy the assumptions of Theorem 3.1 and assume that the 
f 3 and gj, exist for all j < k + 1. Further, let r n — > and Ho(&, T) > 0. 
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l.If<& is a-negatively associated, then 

cov (g„($, r), G n (*, r )) = o(E{G n ($, r)}). 

If & is a-associated, then 

Cov(G n ($,r),G n ($,r )) = n(E{G ri ($,r)}). 

Proof. We shall prove the result for a-negatively associated processes 
and k > 2. The a-associated case follows by reversing the inequality sign in 
(3.19) below and the case k = 1 needs a few simple notational changes. We 
shall again use the Campbell-Mecke formula to obtain closed-form expres- 
sions for the second moments and then perform a similar analysis as in the 
proof of Theorem 3.1 to obtain the asymptotics. 

For j < I and x = (x±, . . . ,Xk+i-j), in analogy to (3.8)-(3.10), define 

^-r,r j( x ) : = hr(xi,...,xk)hr (xi,...,Xj,Xk+i,...,x k +i-j), 
ro j>( x ) :— hr, n {xi, ■ ■ ■ , Xk)hr , n {xi, ■ ■ ■ , Xj, x^+i, ■ ■ ■ , x^+i-j)- 

Then 

(3.19) E{G\($,r)G n ($,r )} 
= E< 



h r , n (x)h ro , n (y) 

{x,yc$,\x\=h,\y\=i 



i 



j=o [x,yc$,\x\=k,\y\=i,\xny\=j 



1 



^r,ro,o,n( x )P (fe) ( x i' • • • 



xp w (x fc+ i,...,x fc+i )dxi... cte A 



+E{G n ($,r)}E{G n ($,r )}, 
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where the inequality is due to the a-negative association property. Thus 
using similar arguments as in the proof of Theorem 3.1 and setting y = 
(0,2/2 • • .,yk+i-j), we have 

Cov(G n ($,r),G„($,r )) 

~ E "^"^ry / "r,r„,(y) 9 r- fl (y) <iy 

~ 3\k - - 3)1 J K d(k+i- 3 -i) p 

= 0{nr d n ^f k {r n )) 
= 0(E{G n (cD,r)}), 

which is what we needed to show. □ 

Unlike the sparse regime, subgraph counts and component counts have 
different limits in the thermodynamic regime and hence we need variance 
bounds on component counts in the thermodynamic regime. 

Theorem 3.5 (Variance bounds in the thermodynamic regime). Let <1> 
be a stationary point process in ~R. d of unit intensity and T be a feasible 
connected graph of $ on k vertices. Assume that is almost everywhere 
continuous. Let r d — > (3 > and 7 i g( < l ) , T) > 0. 



1. If is negatively associated, then Var(^ </„($, T)j = 0(E{ J n ($, T)}). 

2. //$ is associated, then Var^J n (<I>, V) \ = 0(E{ J n ($, r)}). 

Proof. Firstly, write 
J n ($,r) 2 = J n ($,r)+ h T>n {X)hr,n{Y) 

X,Yc$ n ,\X\=\Y\=k 

xl[*(S x (r„)) = $(Sy(r n )) = 0]. 

By the Campbell-Mecke formula, 

E{j n ($,r) 2 } = E\j n (<S>,r)} + -±- [ ^r,n(x)/ir,n(y) 
I J I J (k\y Jw^xw^ 

xl[G({x, y}; r n ) is disconnected] 
xPj Ciy {f(5 x (r n )) = $(B y (r n )) = 0} /3 ( 2fc )(x, y )dxdy. 
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Thus, 

Var(j n ($,r)) = E<jj n ($,r)} 



+77779 / /ir,n(x)/i r ,n(y)Qn(x,y)dxdy, 



where 

Q n (x,y) := l[G7({x, y}; r n ) is disconnected] 

xP^ y {$(5 x (r n )) = <&(fl y (r n )) = 0} 
x /3 ( 2fc )(x,y)-P^(S x (r n )) = 0} 
xP y {$(B y (r ri )) = 0}pW(x)p( fe )( y ). 

Choose n large enough so that r n < /33 + \. For such an n and negatively 
associated <£, we know from Lemma 2.2 that Q„,(x, y) < for all x, y such 
that the set distance ds(x,y) := infjj \\xi — yj\\ > 3/33. Thus, we have that 

Var(j n (d>,r)) < E{j re ($,r)} + -^ I h T>n (x)h T , n (y) 

xQ n (x,y)l[d(x,y) < 3/3d]dxdy. 

From Theorem 3.3, we know that E|j n ($,r)| = O(n) and using similar 
methods as in that theorem, one can show that the latter term in the above 
equation is of 0(n). Combining the two, we get that Var^ J n (3>, T)j = 0(n). 

The proof for associated point processes is same as above except that 
Q n (x, y) > for all x, y such that dg(x, y) > 3/3^, giving that in this case 

Var( J re ($,r)J = S7(n), as required. □ 



3.5. Phase transitions in the sparse and thermodynamic regimes. So far, 
we have concentrated on the asymptotic behaviour of the expectations of the 
numbers of different types of subgraphs that appear in the random graph 
associated with a point process. In this section we shall combine expecta- 
tions on first and second moment to obtain results about these numbers 
themselves, looking at probabilities that they are non-zero, as well as L2 
and almost sure results about growth and decay rates. The main theorem 
of this section is the following: 
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Theorem 3.6. Let & be a stationary point process with almost every- 
where continuous joint densities and T a feasible connected graph for 3> on 
k vertices. 

1: Let $ satisfy the assumptions of Theorem 3.1 with /Uo($, T) > 0. Let 

TWO. 

(a) Ifnr$ k - 1) f k (r n ) -> 4 , then F {G n ($, T) > 1} -> 0. 

^ ijf $ is a-negatively associated and nrf[ k ^ f k {r n ) — > j3 for some < 

P < oo, i/iere £/iere exists a finite C (dependent on the process but not 

onT) for which 



lim P{J n ($,r) > 1} > 



1 



C 



fc,) If & is a-negatively associated and nr^ k ^ f k {r n ) — > oo, i/jen 



Jn($,r) 



d(fc-l) ffc 



f k (Tn) 



% Mo($,r). 



iei & be a negatively associated point process satisfying the assumptions 
of Theorem 3.3 with jp($,r) > 0. Let r% -> /3. T/ien, 



n 



Proof. The proof for Part 1(a) follows from Markov's inequality and 
Theorem 3.1. The proof of 1(b) is based on the following second moment 
bound: 



'{j n ($,r) > 1} > 



> 



> 



E{j n ($,r) 2 } 
(E{j n ($,r)}) 2 

E{G n (cD,r)2} 

(E{G n (<D,r)}) 2 , Var(G n ($,T)) 



+ 



(E{j„(cD,r)})2 (E{j„($,r)})2 



Now, by applying Theorem 3.4, we obtain that there exists a C > for 
which 



'{J„($,r) > i} > 



+ 



(E{G n (d),r)})- 

(E{j n (d>,r)}) 2 1 E{G„($,r)} 



4 Note that neither this assumption nor the one in 1(b) can hold for k = 1, as / (r) = 1. 
Hence the statements do not say anything in these two cases. 
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Under the assumptions of 1(b), E{G n ($,r)} converges to /3/xo($,r), while 
the first term in the square brackets converges to 1 by Theorem 3.1. 
For 1(c), observe that 

Var( J n ($, T)) < Var(G n ($, r))+2E{G n ($, T)} E{E n ($, r)}-(E{E n ($, F)}) 2 

where E n (<&,T) is as defined in the proof of Theorem 3.1. From the proofs 
of Theorem 3.1 and Theorem 3.4, it follows that 

Var(J n ($,r)) = 0(Var(G n (*,r))) = 0(nr^f k (r n )), 

which completes the proof for this case. 

We now prove Part 2 in a similar fashion. In fact, it follows easily from 
Theorem 3.5 and the relation between J n and J n noted in the proof of 
Theorem 3.3. More specifically, as n — > oo, 



J? 



Var 

and 

E [ ||^.r)-i„(#.r)|| l _ 'IWfW^ : „ 

n n 

Thus, we have that 4 7 ^($, T) and 11 M*,r)-M*,?)\\ 4 as n Xm 

□ 

Since J n is a A'^-Lipschitz functional of counting measures for a constant 
Kd depending only on the dimension d (see [30, Proof of Theorem 3.15]), the 
result in (2) above can be strengthened to a.s. convergence for stationary 
determinantal point processes by using the concentration inequality in [29, 
Theorem 3.6]. We leave the proof, which is not hard, to the reader. 

Theorem 3.7. Let $ be a stationary determinantal point process of unit 
intensity and V a feasible connected graph for ^ on k vertices. Let — > (3 > 
0. Then, for jp($>,T) as defined in (3.18), 

y 7j8(*}Ij, as n — >■ oo. 

n 
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4. Betti numbers of random geometric complexes. This is really 
the main section of the paper, giving, as it does, results about the homology 
of random geometric complexes through their Betti numbers. Despite this, 
it will turn out that, as mentioned earlier, the hard work for the proofs has 
already been done in the previous section. 

We shall start with a review of the basic topological notions needed to 
formulate our results, along with an explanation of the connections between 
Betti numbers of random complexes and component numbers of random 
geometric graphs. This connection was established and exploited in [18, 19] 
to extract theorems for Betti numbers from those for the component counts 
of random geometric graphs. 

4.1. Topological preliminaries. Recall that Cech and Vietoris-Rips com- 
plexes and their faces were already defined at Definitions 1.1 and 1.2 in the 
Introduction, and that the dimension of a face a is |<r| — 1. Recall also that 
the edges of the random geometric graph G(&, r) are the 1-dimensional faces 
of C($,r) or R($,r). 

Now, however, we require some additional terminology. 

The Vietoris-Rips complex i?(<&, r) is also called the clique complex (or 
flag complex) of G(<&, r), as the faces are cliques (complete subgraphs) of the 
1— dimensional faces. Let Hk(C($, r)) and Hk(R(<$>, r)), respectively, denote 
the k-th simplicial homology groups of the random Cech and Vietoris-Rips 
complexes. (We shall take our homologies over the field Z2, but this will not 
be important.) In this section we shall be concerned with asymptotics for 
the Betti numbers (3k(C($ n ,r)) and fik{R{§ n , r)), (i.e. the ranks of the ho- 
mologies) and through them the appearance and disappearance of homology 
groups. 

Next, let Pj. be the (/c + l)-dimensional cross-polytope in containing 
the origin, and defined to be the convex hull of the 2fc+2 points {±ej}, where 
ei, . . . , efc_|_i are the standard basis vectors of IR fc+1 . The boundary of 
which we denote by Ok, is a fc-dimensional simplicial complex, homeotopic 
to a A;-dimensional sphere. In terms of simplicial homology, the existence of 
subsimplices isomorphic to Ok is the key to understanding /c-cycles and so 
the k-th. homology. In fact, from [17, Lemma 5.3] we know that, because of 
the distance relationships between the vertices of a Vietoris-Rips complex, 
any non-trivial element of the /c-dimensional homology Hk(R(&,r)) arises 
from a subcomplex on at least 2k + 2 vertices. If it has only 2k + 2 vertices, 
then it will be isomorphic to Ok- 

Now let r{ , j = 1, . . . , rik (n^ < 00) be an ordering of the different graphs 
that arise when extending a (k + l)-clique (i.e. a /c-dimensional face) to a 
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minimal connected subgraph on 2k + 3 vertices. Thus, the T J k are all graphs 
on 2k + 3 vertices, having ( fc ^ 1 ) + k + 2 edges. 

Finally, for a given finite graph T, let G($, V) denote the number of sub- 
graphs of G($,r) that are isomorphic to T. However, as explained in 3 , 
G(& n ,T) is a finite linear combination of G n (&,T )'s with T 's being of the 
same order as T. 

Then [17, Lemma 5.3] and a dimension bound in [18, (3.1)]) imply the fol- 
lowing crucial inequality linking Betti numbers to component and subgraph 
counts in Vietoris-Rips complexes for k > 1; 

(4.20) J($, O fc ) < r)) < J($ , O fc ) + ]T r J fc2fc+3 ). 

j'=i 

A related inequality holds for Cech complexes. Let Tfc be the graph on k 
vertices such that any k — 1 vertices form a (k — l)-clique but is not a 
/c-clique. Any collection of vertices X for which G(X, r) ~ is said to form 
an empty (k — l)-simplex. Let T fc be the graph of a (/c — l)-clique with an 
extra edge attached to two vertices and T k be the graph of a (k — l)-clique 
with a path of length 2 attached to one of the vertices. Both T k and T k are 
graphs of order k + 1. The we have the following combinatorial inequality 
from [19, (5)] for k > 1: 

(4.21) r fc+2 ) < /3 fc (c($, r)) < J($, r fc+2 ) + g($, r; +2 ) + r'; +2 ). 

With these combinatorial inequalities in hand, we are now ready to develop 
limit theorems for the Betti numbers of the random Cech and Vietoris-Rips 
complexes (Section 4.2) as well as find thresholds for vanishing and non- 
vanishing of homology groups (Section 4.3). 

4.2. Expectations of Betti numbers. We return now to the setting of a 
stationary point process $ in M d and the sequence of finite point processes 
<£ n . Our results all follow quite easily from the corresponding limit theorems 
in Section 3, and we continue to use the notation of that section without 
further comment. 

The underlying heuristic is that in the sparse regime the order is deter- 
mined by the order of the minimal structure involved in forming homology 
groups, which is Ok for the random Vietoris-Rips complex and r& for the 
random Cech complex. Using Theorem 3.1, it is easy to see that these are 
the leading order terms and that the G and G terms in both (4.20) and 
(4.21) are, asymptotically, irrelevant. Hence, we have the following result. 
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Theorem 4.1 (Sparse regime; r n — Y 0) . Let be a stationary point 
process in M d satisfying the assumptions in Theorem 3.1 for all k > 1. Let 
r n — > 0. Further, assume that /j,q($, Tf.) > and /io(^j Ofc) > for all k > 1. 
Then, for all k > 1, 

r E{/3 fc (C(3> n ,r n ))} 
hm — ,, n , , = wo(*, Ok)- 

n ^ nr d{2k + l) f2k+2{rn) AOV 

For A; = 0, we /iai>e t/iat 

Hm E{/3 (C(^ n ,r n ))} = nm E{fl)(fl($ n ,r ra ))} = 1 

Proof. We start with the case k > 1. From Theorem 3.1, the orders of 
magnitude of the terms in (4.20) and (4.21) are as follows : 

E{J n (<I>,r fc+2 )} = e(nr< fc +V +2 (r n )), 

E{G n (^F' k+2 )} = Q(nr d n ^f k+3 (r n )), 

E{G n ($,T' k+2 )} = &(nr^f k+3 (r n )), 
E{J n ($,O k )} = 9(nr5 2fe+1 )/ 2fc+2 (r„)), 
E{G($,rJ )2fc+3 )} = 9(nrW)/ 2fc+3 (r n )) 1 < j < n k . 

Substituting these into (4.20) and (4.21), and using the fact that the limits 
of E{J n (&,T k+2 )} and E{ J n (&, O k )} are explicitly known from Theorem 
3.1, completes the proof of the theorem. 

For the case k = 0, the bounds similar to (4.20) on j3o and a similar 
argument will give the right asymptotics. □ 

Turning now to the thermodynamic regime, and applying the same argu- 
ments as in the previous proof, but using Theorem 3.3 in place of Theorem 
3.1, we find that all the terms in (4.20) and (4.21) are of order G(n). This 
leads to the following result. 

Theorem 4.2 (Thermodynamic regime: r d — > f3). Let & be a stationary 
point process in M. d satisfying the assumptions in Theorem 3.3 for all k > 1. 
Let r d ^-/3e (0,oo). Further, assume that 7 / g($,r fe ) > and^p(^,Ok) > 
for all k > 1. Then, for all k > 0, 

E{/3 fe (C($ n ,r n ))} = 0(n), E{&(12(* n ,r n ))} = G(n). 
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We note without proof that one can obtain ordering results for Betti 
numbers of a — w ordered point processes in the sparse regime analogous to 
Corollary 3.2 but not in the thermodynamic regime. 

4.3. Thresholds for homology groups. Our aim in this subsection is to 
establish results about the conditions under which different homology groups 
appear and disappear in the homology of random complexes. We shall need 
to treat Cech and Vietoris-Rips complexes separately and start with results 
on the contractibility of these. We follow these with the key results of the 
section, on thresholds for the appearance and disappearance of homology 
groups. These results also show that 7-weakly sub-Poisson point processes 
have lower vanishing thresholds for given r-components. As a corollary to 
the results on Cech complexes, we also obtain an asymptotic result on the 
behaviour of the Euler characteristic x(C($,r). 

Recall that there are a number of equivalent definitions for Euler charac- 
teristic. However, the most natural for us at this point is 

(4.22) x(C($,r) := ^(-l) fc /3 fc (C(<I>, r)). 

fc>0 

Theorem 4.3 (Contractibility of Cech complexes). Let & be a stationary 
j-weakly sub-Poisson point process. Then there exists a > such that for 
r n > Cd(logn)d, whp C(<J? n ,r n ) is contractible and x(C($ n ,r n )) = 1. 

PROOF. We start with a proof of contractibility, and then show that 
x(C($„,r n )) = 1, whp. 

As in the proof of contractibility for Poisson Cech complexes in [18, The- 
orem 6.1], we will show that, for our choice of r n , the set Uxe* ^x(f n /2) 
covers W n whp. Then, the nerve theorem of [3, Theorem 10.7] implies that 
the Cech complex is contractible whp. 

Let 7j d be the d-dimensional lattice and let Q Zi , 1 < i < N n be an enumer- 
ation of the cubes of the scaled lattice -^^ d that are fully contained within 
W n . If every cube contains a point of <£, then Uxe* Bx(r n /2) covers W n . 
By the union bound, 

N n 

r{W n ^U x ^ n B x (r n /2)} < X>W9*)=°} 

i=l 
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< ^— 1 — e K s-Sd> , 

— r d 
1 ii 

where $n) is the Poisson point process of unit intensity. All that remains is 
to choose an appropriate Cd > to complete the proof of contractibility for 
general stationary 7-weakly sub-Poisson point processes. 

As for the proof of the statement about the Euler characteristic, the fol- 
lowing obvious bound suffices. 

< P {/3 (C($ n , r n )) = 1, /3 fc (C($ n , r n )) = 0, k > 1} 
<P{ X (C($ n ,r n )) = l}. 

□ 

With these results in hand, we can now use the bounds (4.20) and (4.21) 
along with a.s. existence results of Section 3.4 to complete the picture about 
vanishing and non-vanishing of homology groups of Cech complexes and 
Vietoris-Rips complexes. 

Theorem 4.4 (Thresholds for Cech complexes). Let & be a stationary 
point process satisfying the assumptions on its joint intensities as in 
Theorems 3.1 and 3.3 for all k > 1. Then the following statements hold : 

1. Let § be a ^-weakly sub-Poisson point process. 

(a) If 

r d { k + l) fk+ 2 {rn) = o(n -l ) Qr r d = w(logJ; 

then H k (C($ n ,r n )) = 0,k > 1, whp. 

(b) Ifr* = w(log n ), then H (C(<S> n ,r n )) = 1, whp. 

2. Let & be a negatively associated point process. Further assume that 
/xo($, r fc ) > and 7,3 T k ) > 0, both for all k > 1 and all j3 > 0. 

N If 

r d [k+ i )fk+ 2 {rn) = w(n -i } and r d = 

then H k (C(® n ,r n )) ^0,k> 1, whp. 
(b) If 4 = 0(1), then H (C($ n ,r n )) ± 0, whp. 

In the absence of a contractibility result for the Vietoris-Rips complex, we 
are unable to estimate the second thresholds, where the homology groups 
vanish. Thus, we have the following less complete picture for the Vietoris- 
Rips complex. Since H (C(Q n ,r n )) = H (R(<& n ,r n )), we shall restrict our- 
selves to only H k (R(& n ,r n )), k > 1, in the following theorem. 
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Theorem 4.5 (Thresholds for Vietoris-Rips complexes). Let $ be a sta- 
tionary point process satisfying the assumptions on its joint intensities 
as in Theorems 3.1 and 3.3 for all k > 1. Then the following statements 
hold for k > 1 : 

L U rf fe+1 )/ 2fe+2 (r„) = oin- 1 ), 

then H k (R($ n ,r n )) = 0, whp. 
2. Let & be a negatively associated point process. Further assume that 
/J,o{®, Ok) > and 7 / g(<I ) , Ok) > 0, both for all k > 1 and all (3 > 0. If 

rf k+1) f k+1 (r n ) = lo^ 1 ) and r d n = 0(1), 

then H k (R($ n ,r n )) + 0, whp. 

4.4. Further results for the Ginibre process:. Using the special structure 
of the Ginibre point process, we can improve on the threshold results of the 
last section. The radius regime for contractibility of Cech complexes over 
the Ginibre point process and zeros of GEF can be made more precise, as 
more is known about void probabilities in these cases. Once we have the 
contractibility results, the upper bounds for vanishing of Betti numbers in 
this special case can be improved. 

Theorem 4.6 (Contractibility of Gech Complexes). Let $ be the Ginibre 
point process or zeros of GEF, then there exists a Cd > ( depending on the 
point process) such that for r n > Crf(logn)4, whp C(& n ,r n ) is contractible. 
Hence, H (C{$ n ,r n )) = 1, ff fc (C(* n ,r„)) = 0, k > 1 and x(C(*„,r„)) = 1 
whp for r\ = u(\/logn). 

Proof. The proof follows along similar lines as the proof of Theorem 4.3 
except that in this case, the void probabilities are of strictly lower order and 
so, the radius for contractibility as well. More precisely, we know from [2, 
Propositions 7.2.1 and Theorem 7.2.3] that for the Ginibre point process and 
zeros of GEF, - log(F {$(B (r)) = 0}) = 6(r 4 ) as r — > oo. All that remains 
is to substitute these bounds into the proof of Theorem 4.3 to derive the 
corresponding results for the Ginibre point process and zeros of GEF. □ 

For Vietoris-Rips complexes, the contractibility result for the Ginibre 
point process is a consequence of the upper bounds for the Palm void prob- 
abilities. As in the previous subsection, once we have contractibility results, 
we can obtain upper bounds for vanishing of the Betti numbers as well. 
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Theorem 4.7 (Disappearence of homology groups for Vietoris-Rips com- 
plexes). Let $ be the Ginibre point process. Then there exists a C dt k > 
such that for r n > Cdfc(logn)4, we have that whp Hk(R($ n ,r n )) = 0, k > 1. 

The proof uses the discrete Morse theoretic approach (see [14]) similar 
to that of [18, Theorem 5.1] and the reader is referred to that proof and 
the appendix in [18] for missing details. As in [18, Theorem 5.1], our proof 
actually shows topological ^-connectivity though we do not state it explicitly 
to avoid defining further topological notions. 

Proof. As the point process is simple and stationary, index the points 
in $ as X\,X2, ■■■ such that ||-X"i|| < H-X^H < H-X3II < . . .. Define V to be the 
collection of pairs of simplices (Vi, V2), V\ C V2 with 

Vi = [X h ,. . .,X ik ] and V 2 = [X io ,X h ,. . .,X h ], 

where io < i\ < ...ik- In words, we pair a simplex with another simplex of 
codimension 1 in the original simplex only if the additional point is closer 
to the origin than the rest. A simplex that is not in V is said to be a critical 
simplex. Let be the number of critical fc-simplices of V . From discrete 
Morse theory, we know that /3jt(i?($ n , r n )) < GV Thus, we only need to 
show that E{Cfc} — > for all k > 1, for an appropriate choice of radii. 
A fc-simplex X = [Aj , . . . ,Xi k ] where io < i\ < ■ ■ ■ < ik is critical only if 

$ n (n<] =0 B Xi .(r)nB o (\\x i0 \\)) = {X i0 }. 

Hence, using Campbell-Mecke formula for the first inequality, then using [18, 
Lemma 5.3] - i.e. for a critical fc-simplex as above, there exists an > 
and x € M rf such that 

k 

B x {e d ) C f]B Xi .{r)nB o {\\X i0 \\) 
3=0 

- and Lemma 6.1 for the second inequality and finally < 1 for the last 
inequality, we find that 

E{C fc } < 

< 



/ l[x is a simplex]l[||xioll < \\xi-, II < . . . < \\xi. 



xF^ n (n"B Xi (r)nB (\\x 



in 1 



0}p( fe )(x)dx. 



^ v {{k + l){e d rf-{e d rf{- + o{l))} 



W n xB x .(r) k 



/> (fc) (x)dx. 
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< nr 2k ^{k{e d rf - (e d r)\± + o(l))}. 

It is easy to see that there exists a constant C d) k > such that E{Cfc} — > 
for r n > CdkQogn)* , and so we are done. □ 

5. Morse theory for random geometric complexes. Our aim in 

this section is to present a collection of results looking at random geomet- 
ric complexes from the point of view of Morse theory. This was done in 
considerable detail for Poisson and iid point processes in [7], where it was 
shown that a Morse theoretic approach can give an intrinsically richer set 
of results than that obtained by attacking homology directly, and that the 
Morse theoretic results have, as usual, implications about Betti numbers. 

We do not intend to give full proofs here, but rather to set things up 
in such a way that parallels between the structures that have appeared in 
previous sections and those that are natural to the Morse theoretic approach 
become clear, and it becomes 'obvious' what the Morse theoretic results will 
be. Full proofs would require considerable more space, but would add little 
in terms of insight. We note, however, that this does not make the proofs 
of [7] in any way redundant. On the one hand, the results there go beyond 
what we have here (albeit only for the Poisson and iid cases) and it is their 
existence that allows us to be certain that the parallels work properly. 

We start with some definitions and a quick description of the Morse the- 
oretic setting. 

5.1. Morse Theory. Morse theory for geometric complexes is based on 
the distance function, d$ : M. d — > M + , defined by 

ds>(x) := min \\x — X\\, x <E R d . 

Note that while classical Morse theory deals with smooth functions, the 
distance function is piecewise linear, but non-differentiable along subspaces. 
The extension to the distance function of classical Morse theory is discussed 
in detail in [7], based on the definitions and results in [15], and we shall adopt 
the same approach. The main difference between smooth Morse theory and 
that based on the distance function lies in the definition of the indices of 
critical points. 

Critical points of index of the distance function are the points where 
d$ = 0, which are local and global minima, and are the points of <3?. For 
higher indices, define the critical points as follows : A point c G M rf is said to 
be a critical point with index 1 < k < d if there exists a collection of points 
X = {X\, . . . , Xk + i} C such that the following conditions hold : 
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1. <i$(c) = ||c — Xi\\ for all 1 < i < k + 1 and d$(c) < ||c — Y\\ for all 
Y G $ \X. 

2. The points -Xj, 1 < i < + 1 lie in general position, viz. they do not 
lie in a (k — l)-dimensional affine space. 

3. c G cont?°(X), where cont>°(X) denotes the interior of the convex hull 
formed by the points of X. 

Let C(X) denote the center of the unique (k — l)-dimensional sphere (if it 
exists) containing the points of X G and -R(X) be the radius of the 

ball. The conditions in the definition of critical points can be reduced to 
the following more workable conditions (see [7, Lemma 2.2]). A set of points 
X G <J>( fc+1 ) in general position generates an index k critical point if, and 
only if, 

C(X) G conv°(X) and $(fl c(X )(J?(X))) = 0. 

Our interest lies in critical points which are at most at a distance r from <£, 
viz. those for which d$(c) < r, or, equivalently .R(X) < r. The reason for 
this lies in the simple fact that 

^(M) = [jB x (r), 

and, as we already noted earlier, by the nerve theorem this is homotopy 
equivalent to the Cech compilex C(<I ) ,r). 

The following indicator functions will be required to draw the analogy 
between counting critical points and counting components of random geo- 
metric graphs. For X G <E>( fe+1 ), define 

h(X) := 1[C(X) G conv°{X)], h r (X) := 1[C(X) G conv (X)]t[R(X) < r). 

Note that these functions are translation and scale invariant, as were the hr 
functions defined for the subgraph and component counts in Section 3, viz. 
for all x G R d and y = (0, y u . . . , y k ) G R d ( k+l \ 

h r (x,x + ryi, ...,x + ry k ) = hi(y). 

This was the key property of h-p used to derive asymptotics for component 
counts. Thus, once we manage to represent the numbers of critical points 
as counting statistics of h r , the analogy with component counts is made. 
To this end, let A^($, r) be the number of critical points of index k for the 
distance function d& that are at most at a distance r from <I>. Then, 

(5.23) N k ($,r) = Yl MX)*[*(£c(X)(fl(X)))=0] 

xe*( fe + 1 ) 
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The similarity between the expression for Nk(§ n ,r n ) and J n (cf. (3.11)) 
should convince the reader that the method of proof used for component 
counts will also suffice for a derivation of the asymptotics of Morse critical 
points. Although the void indicator term is slightly different, we can use the 
fact that .R(X) < r for h r (X.) = 1 to apply the techniques of Section 3 with 
only minor changes. 

5.2. Limit theorems for expected numbers of critical points. As in previ- 
ous sections, we shall give results for the sparse and thermodynamic regimes 
separately. In the Betti number results, in the sparse regime (r n — > 0) the 
scaling factor of n for J n (see Theorem 3.1) arose from the translation in- 
variance of hr and $. The factor of r^ k ^ was due to the scale invariance 
of /ip, and the factor of f k (r n ) came from the scaling of the joint intensities 
p( k K Since h r is also translation and scale invariant, we work under the same 
assumptions on as in Theorems 3.1 and 3.4 with corresponding conditions 
h r in order to obtain asymptotics for expected number of critical points of 
the distance function. The corresponding result is as follows: 

Theorem 5.1 (Sparse regime). Let <3? be a stationary point process in 
]R. d satisfying the assumptions of Theorem 3.1 for all 1 < k < (d + 1). Let 
r n ->■ and y = (0, 3/1,3/2, • • • ,Vk)- Then, for < k < d, 

Jl fc = 0, 

Further, Var(A r / c ($ n , r n )) = 0(E{Nk(Q n , r n }) for negatively associated point 
processes and Var(A r fc($ n , r n )) = 0(E{iV/ c ($ n , r n }) for associated point pro- 
cesses. 

One point that is deserving of additional comment for the proof is that, 
as in Theorem 3.1, we can omit the void probability term in the limit by 
the following reasoning: Since R(y) < r if h r (y) = 1, y = (0, y\, . . . , yi~), we 
have that whenever h rn (y) = 1, 

1 (r|)) = o) C {<S>(B 1 (r|i?(y)) = ol , 

and the probability of the left event here (and hence the right as well) tends 
to 1. This follows from similar arguments to those in the proof of Theorem 
3.1. 



lim 



E{A^($ n ,r n } 
(rn) 



00 nr%*f 



dk fk+1 
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Turning now to the thermodynamic regime, we saw in Theorem 3.3 that 
the sole scaling factor of n for component counts is due to the translation 
invariance of hr and <£. The same remains true for mean numbers of critical 
points. 



Theorem 5.2 (Thermodynamic regime : — > ft.). Let & be a stationary 
point process in M rf satisfying the assumptions of Theorem 3.3 for all 1 < 



k < (d+ 1). Let rl 
0<k<d, 



P G (0,oo) and y = (0,y 1 ,y 2 , . . . ,y k )- Then, for 



lim 

n— >oo 



E{N k (<S> n ,r n } 
n 

F l (<S>(B (^R(y))) = 0) 



if k = 0, 



if 



Kk<d. 



Further, assume that $ is also a negatively associated point process such that 

p{$( j B c(x) (/35)) = o} >0 

for a.e. x = (0, x%, . . . , x k ) £ Bo(3[3d ) k+1 , and for all 1 < k < d. Then 
v k (<f>, (3)>0 for all0<k<d. 

Also, Var(iVfc(<I> n , r n )) = 0(E{A^.(<I> n , r n }) for negatively associated point 
processes and Va r ( (<]?„, r n )) = ^(EfiV^ln,^}) for associated point pro- 
cesses. 

As previously, the void probability needs some attention. In this case, to 
show its positivity, we again use the fact that R(y) < 1 if h\(y) = 1 and 
hence, whenever /ii(y) = 1, 

The positivity of the first event under Palm probability is guaranteed by our 
assumption via Lemma 2.1. 

Finally, we turn to a result about Euler characteristics that is not accessi- 
ble from the non-Morse theory. We already defined the Euler characteristic 
in terms of Betti numbers at (4.22), and showed in Theorem 4.3 that, in 
the connectivity regime, it is 1 with high probability. However, taking an 
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alternative, but equivalent, definition via numbers of Morse critical points, 
we can deduce its L\ asymptotics in the sparse and thermodynamic regimes 
as a corollary of the previous results in this section. The alternative defini- 
tion, which is more amenable to computations due to the bounded number 
of terms in the following sum, is 

d 

X (C($,r)) := ]T(-l) fc iW)- 

k=0 

Theorem 5.3. Let & be a stationary point process in IR d satisfying the 
assumptions of Propositions 3.1 and 3.3 for all 1 < k < (d + 1). 

(i) If r n — > 0, then 

n- 1 E{ X (C($ n ,r n ))}^l. 
(ii) Ifr^^PG (0,oo), then 



n 



-^{xiC^n, r n ))} -> 1 + ^(-l) fc ^(d>, f3). 



k=l 



(Hi) If $ is also a negatively associated point process, then the above con- 
vergences also hold in the L^-norm. 

To prove the Part (iii) of the theorem, we need variance bounds, which 
is why we require the additional assumption of negative association. For 
example, in the sparse regime, we have the following bound via the Cauchy- 
Schwarz inequality : 



E< 



n 



E< 



n 



k=l 



n 



< d ( Var($(W n )) | ^ E{ N k (<S> n , r n f) 



n- 



k=i 



n- 



d f Var($(VT n )) | ^ Var(JV fc ($ n ,r ra )) 



ii- 



k=l 



11- 



+ E 



(E{N k ($ n ,r n )}f 



k=l 



11 



The L2 convergence follows once it is noted that all the terms on right 
hand side converge to due to the variance bounds proven for negatively 
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associated point processes. A slight modification of this argument handles 
the thermodynamic regime as well. 

6. Appendix. In this section, we prove the result about Palm void 
probabilities of Ginibre point process that is used in the proof of Theorem 
4.7. The proof is due to Manjunath Krishnapur. 

Lemma 6.1. Let D = Bq(t) C M? for some r > and <I> be the Ginibre 
point process. Then for k > 1 and x S M. 2k , 

Fl{^(D) = 0} < exp{£x 2 }P {$(£>) = 0} = exp{kr 2 - r 4 (^ + o(l))}. 

Proof. We shall prove the result for k = 1. The proof for the general 
case then follows by a recursive application of the same argument. 

Let YZd be the restriction to D of the integral operator JC corresponding 
to Ginibre point process. Since the Palm process of the Ginibre point pro- 
cess is also a determinantal point process, let £c be the integral operator 
corresponding to the Palm point process restricted to D. Let Aj, i = 1, 2, . . . 
and Hi, i = 1, 2, . . . be the eigenvalues of JCd and Cd respectively. From [32, 
Theorem 6.5], we know that K,d — Lb has rank 1, and hence, by a gener- 
alisation of Cauchy's interlacement theorem, the respective eigenvalues are 
interlaced with \i> [n> Aj+i for i = 1, 2, . . .. 

Now, consider the case k = 1 and we have the following inequality : 

p i x {$ (D) = o} = nu - ^) < - A *) = t 1 - A ir lp = °i > 

i>l i>2 

where the two equalities are due to Theorem [2, 4.5.3] and the inequality 
is due to the generalisation of Cauchy's interlacement theorem described 
above. Now using Proposition [2, Proposition 7.2.1] to bound P{&(D) = 0} 
and from the fact that 1 — Ai = F {EXP(1) > r 2 } = exp{— r 2 } (see proof 
of [2, Theorem 4.7.1]), where EXP{1) is the exponential random variable 
with mean 1, we have the desired inequality for the case k = 1. □ 

7. Acknowledgements. The authors wish to thank Manjunath Krish- 
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